Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 1118122 |
| Missing cells | 194456 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 136.5 MiB |
| Average record size in memory | 128.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 4 |
time has a high cardinality: 45406 distinct values | High cardinality |
gameId is highly correlated with team | High correlation |
frameId is highly correlated with s and 1 other fields | High correlation |
s is highly correlated with dis | High correlation |
a is highly correlated with s | High correlation |
dis is highly correlated with s | High correlation |
team is highly correlated with gameId | High correlation |
nflId has 48614 (4.3%) missing values | Missing |
jerseyNumber has 48614 (4.3%) missing values | Missing |
o has 48614 (4.3%) missing values | Missing |
dir has 48614 (4.3%) missing values | Missing |
s has 71193 (6.4%) zeros | Zeros |
a has 66378 (5.9%) zeros | Zeros |
dis has 70153 (6.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-02 15:07:08.588089 |
|---|---|
| Analysis finished | 2022-11-02 15:08:47.232478 |
| Duration | 1 minute and 38.64 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2021091189 |
| Minimum | 2021090900 |
|---|---|
| Maximum | 2021091300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 2021090900 |
|---|---|
| 5-th percentile | 2021090900 |
| Q1 | 2021091202 |
| median | 2021091206 |
| Q3 | 2021091210 |
| 95-th percentile | 2021091300 |
| Maximum | 2021091300 |
| Range | 400 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 90.53526954 |
|---|---|
| Coefficient of variation (CV) | 4.479524232 × 10-8 |
| Kurtosis | 5.740632478 |
| Mean | 2021091189 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -2.521368112 |
| Sum | 2.259826522 × 1015 |
| Variance | 8196.635031 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2021090900 | 92644 | 8.3% |
| 2021091300 | 92299 | 8.3% |
| 2021091204 | 82271 | 7.4% |
| 2021091201 | 76406 | 6.8% |
| 2021091205 | 75739 | 6.8% |
| 2021091212 | 72266 | 6.5% |
| 2021091203 | 71116 | 6.4% |
| 2021091200 | 69299 | 6.2% |
| 2021091202 | 68172 | 6.1% |
| 2021091209 | 65274 | 5.8% |
| Other values (6) | 352636 |
| Value | Count | Frequency (%) |
| 2021090900 | 92644 | |
| 2021091200 | 69299 | |
| 2021091201 | 76406 | |
| 2021091202 | 68172 | |
| 2021091203 | 71116 | |
| 2021091204 | 82271 | |
| 2021091205 | 75739 | |
| 2021091206 | 63917 | |
| 2021091207 | 63848 | |
| 2021091208 | 64653 |
| Value | Count | Frequency (%) |
| 2021091300 | 92299 | |
| 2021091213 | 57799 | |
| 2021091212 | 72266 | |
| 2021091211 | 48323 | |
| 2021091210 | 54096 | |
| 2021091209 | 65274 | |
| 2021091208 | 64653 | |
| 2021091207 | 63848 | |
| 2021091206 | 63917 | |
| 2021091205 | 75739 |
playId
Real number (ℝ≥0)
| Distinct | 1021 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2187.559921 |
| Minimum | 55 |
|---|---|
| Maximum | 4849 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 55 |
|---|---|
| 5-th percentile | 262 |
| Q1 | 1166 |
| median | 2145 |
| Q3 | 3195 |
| 95-th percentile | 4233 |
| Maximum | 4849 |
| Range | 4794 |
| Interquartile range (IQR) | 2029 |
Descriptive statistics
| Standard deviation | 1243.918429 |
|---|---|
| Coefficient of variation (CV) | 0.5686328485 |
| Kurtosis | -1.009917699 |
| Mean | 2187.559921 |
| Median Absolute Deviation (MAD) | 1017 |
| Skewness | 0.1122809555 |
| Sum | 2445958874 |
| Variance | 1547333.058 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2063 | 4991 | 0.4% |
| 3406 | 4968 | 0.4% |
| 2129 | 4669 | 0.4% |
| 1938 | 3312 | 0.3% |
| 620 | 3289 | 0.3% |
| 1988 | 3220 | 0.3% |
| 788 | 3220 | 0.3% |
| 76 | 3174 | 0.3% |
| 421 | 3013 | 0.3% |
| 2219 | 2944 | 0.3% |
| Other values (1011) | 1081322 |
| Value | Count | Frequency (%) |
| 55 | 713 | 0.1% |
| 56 | 1702 | |
| 63 | 736 | 0.1% |
| 69 | 736 | 0.1% |
| 76 | 3174 | |
| 77 | 1150 | 0.1% |
| 78 | 713 | 0.1% |
| 84 | 644 | 0.1% |
| 85 | 713 | 0.1% |
| 88 | 644 | 0.1% |
| Value | Count | Frequency (%) |
| 4849 | 1035 | |
| 4845 | 782 | |
| 4772 | 667 | |
| 4765 | 1081 | |
| 4750 | 805 | |
| 4736 | 805 | |
| 4728 | 690 | |
| 4699 | 874 | |
| 4695 | 713 | |
| 4691 | 966 |
| Distinct | 1162 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 48614 |
| Missing (%) | 4.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45460.32293 |
| Minimum | 25511 |
|---|---|
| Maximum | 53957 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 25511 |
|---|---|
| 5-th percentile | 37104 |
| Q1 | 42404 |
| median | 44999 |
| Q3 | 47917 |
| 95-th percentile | 53462 |
| Maximum | 53957 |
| Range | 28446 |
| Interquartile range (IQR) | 5513 |
Descriptive statistics
| Standard deviation | 4938.338482 |
|---|---|
| Coefficient of variation (CV) | 0.1086296393 |
| Kurtosis | 0.02710664664 |
| Mean | 45460.32293 |
| Median Absolute Deviation (MAD) | 2835 |
| Skewness | -0.1592549538 |
| Sum | 4.862017906 × 1010 |
| Variance | 24387186.96 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 46089 | 2700 | 0.2% |
| 43453 | 2700 | 0.2% |
| 53436 | 2700 | 0.2% |
| 52483 | 2700 | 0.2% |
| 48455 | 2700 | 0.2% |
| 43290 | 2700 | 0.2% |
| 53601 | 2672 | 0.2% |
| 46187 | 2440 | 0.2% |
| 46084 | 2440 | 0.2% |
| 52517 | 2440 | 0.2% |
| Other values (1152) | 1043316 | |
| (Missing) | 48614 | 4.3% |
| Value | Count | Frequency (%) |
| 25511 | 1690 | |
| 28963 | 1055 | |
| 29550 | 502 | < 0.1% |
| 29851 | 1071 | |
| 30078 | 334 | < 0.1% |
| 30842 | 249 | < 0.1% |
| 30869 | 1007 | |
| 33084 | 1728 | |
| 33107 | 1033 | |
| 33130 | 390 | < 0.1% |
| Value | Count | Frequency (%) |
| 53957 | 321 | < 0.1% |
| 53946 | 177 | < 0.1% |
| 53935 | 68 | < 0.1% |
| 53930 | 444 | < 0.1% |
| 53876 | 134 | < 0.1% |
| 53687 | 47 | < 0.1% |
| 53685 | 159 | < 0.1% |
| 53679 | 125 | < 0.1% |
| 53674 | 1963 | |
| 53668 | 489 | < 0.1% |
| Distinct | 177 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.00878348 |
| Minimum | 1 |
|---|---|
| Maximum | 177 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 21 |
| Q3 | 32 |
| 95-th percentile | 50 |
| Maximum | 177 |
| Range | 176 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 16.12365472 |
|---|---|
| Coefficient of variation (CV) | 0.7007608524 |
| Kurtosis | 7.282689269 |
| Mean | 23.00878348 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.644070527 |
| Sum | 25726627 |
| Variance | 259.9722416 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 27025 | 2.4% |
| 2 | 27025 | 2.4% |
| 21 | 27025 | 2.4% |
| 20 | 27025 | 2.4% |
| 19 | 27025 | 2.4% |
| 18 | 27025 | 2.4% |
| 17 | 27025 | 2.4% |
| 16 | 27025 | 2.4% |
| 15 | 27025 | 2.4% |
| 14 | 27025 | 2.4% |
| Other values (167) | 847872 |
| Value | Count | Frequency (%) |
| 1 | 27025 | |
| 2 | 27025 | |
| 3 | 27025 | |
| 4 | 27025 | |
| 5 | 27025 | |
| 6 | 27025 | |
| 7 | 27025 | |
| 8 | 27025 | |
| 9 | 27025 | |
| 10 | 27025 |
| Value | Count | Frequency (%) |
| 177 | 23 | |
| 176 | 23 | |
| 175 | 23 | |
| 174 | 23 | |
| 173 | 23 | |
| 172 | 23 | |
| 171 | 23 | |
| 170 | 23 | |
| 169 | 23 | |
| 168 | 23 |
| Distinct | 45406 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.5 MiB |
| 2021-09-12T18:04:04.900 | 69 |
|---|---|
| 2021-09-12T19:59:55.500 | 69 |
| 2021-09-12T19:59:55.300 | 69 |
| 2021-09-12T19:59:55.200 | 69 |
| 2021-09-12T22:46:40.400 | 69 |
| Other values (45401) |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 25716806 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2021-09-10T00:26:31.100 |
|---|---|
| 2nd row | 2021-09-10T00:26:31.200 |
| 3rd row | 2021-09-10T00:26:31.300 |
| 4th row | 2021-09-10T00:26:31.400 |
| 5th row | 2021-09-10T00:26:31.500 |
Common Values
| Value | Count | Frequency (%) |
| 2021-09-12T18:04:04.900 | 69 | < 0.1% |
| 2021-09-12T19:59:55.500 | 69 | < 0.1% |
| 2021-09-12T19:59:55.300 | 69 | < 0.1% |
| 2021-09-12T19:59:55.200 | 69 | < 0.1% |
| 2021-09-12T22:46:40.400 | 69 | < 0.1% |
| 2021-09-12T22:46:40.300 | 69 | < 0.1% |
| 2021-09-12T17:53:10.900 | 69 | < 0.1% |
| 2021-09-12T17:53:10.800 | 69 | < 0.1% |
| 2021-09-12T18:59:07.900 | 69 | < 0.1% |
| 2021-09-12T18:59:08.000 | 69 | < 0.1% |
| Other values (45396) | 1117432 |
Length
| Value | Count | Frequency (%) |
| 2021-09-12t18:04:04.900 | 69 | < 0.1% |
| 2021-09-12t17:28:55.000 | 69 | < 0.1% |
| 2021-09-12t17:28:54.800 | 69 | < 0.1% |
| 2021-09-12t17:28:54.600 | 69 | < 0.1% |
| 2021-09-12t17:28:54.400 | 69 | < 0.1% |
| 2021-09-12t17:28:53.200 | 69 | < 0.1% |
| 2021-09-12t17:28:54.300 | 69 | < 0.1% |
| 2021-09-12t17:28:54.200 | 69 | < 0.1% |
| 2021-09-12t17:28:54.100 | 69 | < 0.1% |
| 2021-09-12t17:28:54.000 | 69 | < 0.1% |
| Other values (45396) | 1117432 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5732773 | |
| 2 | 4323516 | |
| 1 | 3641593 | |
| - | 2236244 | 8.7% |
| : | 2236244 | 8.7% |
| 9 | 1664602 | 6.5% |
| T | 1118122 | 4.3% |
| . | 1118122 | 4.3% |
| 3 | 812383 | 3.2% |
| 4 | 797824 | 3.1% |
| Other values (4) | 2035383 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19008074 | |
| Other Punctuation | 3354366 | 13.0% |
| Dash Punctuation | 2236244 | 8.7% |
| Uppercase Letter | 1118122 | 4.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5732773 | |
| 2 | 4323516 | |
| 1 | 3641593 | |
| 9 | 1664602 | 8.8% |
| 3 | 812383 | 4.3% |
| 4 | 797824 | 4.2% |
| 5 | 695036 | 3.7% |
| 8 | 516764 | 2.7% |
| 7 | 492453 | 2.6% |
| 6 | 331130 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2236244 | |
| . | 1118122 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2236244 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1118122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24598684 | |
| Latin | 1118122 | 4.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5732773 | |
| 2 | 4323516 | |
| 1 | 3641593 | |
| - | 2236244 | 9.1% |
| : | 2236244 | 9.1% |
| 9 | 1664602 | 6.8% |
| . | 1118122 | 4.5% |
| 3 | 812383 | 3.3% |
| 4 | 797824 | 3.2% |
| 5 | 695036 | 2.8% |
| Other values (3) | 1340347 | 5.4% |
Latin
| Value | Count | Frequency (%) |
| T | 1118122 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25716806 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5732773 | |
| 2 | 4323516 | |
| 1 | 3641593 | |
| - | 2236244 | 8.7% |
| : | 2236244 | 8.7% |
| 9 | 1664602 | 6.5% |
| T | 1118122 | 4.3% |
| . | 1118122 | 4.3% |
| 3 | 812383 | 3.2% |
| 4 | 797824 | 3.1% |
| Other values (4) | 2035383 | 7.9% |
| Distinct | 99 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48614 |
| Missing (%) | 4.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.46293716 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 22 |
| median | 52 |
| Q3 | 75 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 30.02460577 |
|---|---|
| Coefficient of variation (CV) | 0.6070121891 |
| Kurtosis | -1.347424287 |
| Mean | 49.46293716 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.03076410867 |
| Sum | 52901007 |
| Variance | 901.4769514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 27032 | 2.4% |
| 21 | 21718 | 1.9% |
| 11 | 21176 | 1.9% |
| 90 | 20588 | 1.8% |
| 24 | 20328 | 1.8% |
| 76 | 19610 | 1.8% |
| 74 | 19316 | 1.7% |
| 2 | 19227 | 1.7% |
| 26 | 18104 | 1.6% |
| 72 | 17986 | 1.6% |
| Other values (89) | 864423 | |
| (Missing) | 48614 | 4.3% |
| Value | Count | Frequency (%) |
| 1 | 12809 | |
| 2 | 19227 | |
| 3 | 7991 | |
| 4 | 11779 | |
| 5 | 7551 | 0.7% |
| 6 | 11202 | |
| 7 | 9156 | |
| 8 | 14850 | |
| 9 | 6177 | 0.6% |
| 10 | 14182 |
| Value | Count | Frequency (%) |
| 99 | 12525 | |
| 98 | 13925 | |
| 97 | 16611 | |
| 96 | 9809 | |
| 95 | 10636 | |
| 94 | 16522 | |
| 93 | 12138 | |
| 92 | 6339 | 0.6% |
| 91 | 14946 | |
| 90 | 20588 |
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.5 MiB |
| football | 48614 |
|---|---|
| TB | 44308 |
| DAL | 44308 |
| BAL | 44143 |
| LV | 44143 |
| Other values (28) |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 2.985973803 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3338683 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TB |
|---|---|
| 2nd row | TB |
| 3rd row | TB |
| 4th row | TB |
| 5th row | TB |
Common Values
| Value | Count | Frequency (%) |
| football | 48614 | 4.3% |
| TB | 44308 | 4.0% |
| DAL | 44308 | 4.0% |
| BAL | 44143 | 3.9% |
| LV | 44143 | 3.9% |
| SF | 39347 | 3.5% |
| DET | 39347 | 3.5% |
| PIT | 36542 | 3.3% |
| BUF | 36542 | 3.3% |
| HOU | 36223 | 3.2% |
| Other values (23) | 704605 |
Length
| Value | Count | Frequency (%) |
| football | 48614 | 4.3% |
| tb | 44308 | 4.0% |
| dal | 44308 | 4.0% |
| bal | 44143 | 3.9% |
| lv | 44143 | 3.9% |
| sf | 39347 | 3.5% |
| det | 39347 | 3.5% |
| pit | 36542 | 3.3% |
| buf | 36542 | 3.3% |
| hou | 36223 | 3.2% |
| Other values (23) | 704605 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 366883 | 11.0% |
| N | 279840 | 8.4% |
| L | 255519 | 7.7% |
| I | 252329 | 7.6% |
| E | 192104 | 5.8% |
| C | 187616 | 5.6% |
| T | 183876 | 5.5% |
| D | 148786 | 4.5% |
| B | 148104 | 4.4% |
| S | 100837 | 3.0% |
| Other values (20) | 1222789 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2949771 | |
| Lowercase Letter | 388912 | 11.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 366883 | |
| N | 279840 | 9.5% |
| L | 255519 | 8.7% |
| I | 252329 | 8.6% |
| E | 192104 | 6.5% |
| C | 187616 | 6.4% |
| T | 183876 | 6.2% |
| D | 148786 | 5.0% |
| B | 148104 | 5.0% |
| S | 100837 | 3.4% |
| Other values (14) | 833877 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 97228 | |
| o | 97228 | |
| f | 48614 | |
| a | 48614 | |
| b | 48614 | |
| t | 48614 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3338683 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 366883 | 11.0% |
| N | 279840 | 8.4% |
| L | 255519 | 7.7% |
| I | 252329 | 7.6% |
| E | 192104 | 5.8% |
| C | 187616 | 5.6% |
| T | 183876 | 5.5% |
| D | 148786 | 4.5% |
| B | 148104 | 4.4% |
| S | 100837 | 3.0% |
| Other values (20) | 1222789 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3338683 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 366883 | 11.0% |
| N | 279840 | 8.4% |
| L | 255519 | 7.7% |
| I | 252329 | 7.6% |
| E | 192104 | 5.8% |
| C | 187616 | 5.6% |
| T | 183876 | 5.5% |
| D | 148786 | 4.5% |
| B | 148104 | 4.4% |
| S | 100837 | 3.0% |
| Other values (20) | 1222789 |
playDirection
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.5 MiB |
| right | |
|---|---|
| left |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.50802238 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5040519 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | right |
|---|---|
| 2nd row | right |
| 3rd row | right |
| 4th row | right |
| 5th row | right |
Common Values
| Value | Count | Frequency (%) |
| right | 568031 | |
| left | 550091 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| right | 568031 | |
| left | 550091 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1118122 | |
| r | 568031 | |
| i | 568031 | |
| g | 568031 | |
| h | 568031 | |
| l | 550091 | |
| e | 550091 | |
| f | 550091 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5040519 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1118122 | |
| r | 568031 | |
| i | 568031 | |
| g | 568031 | |
| h | 568031 | |
| l | 550091 | |
| e | 550091 | |
| f | 550091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5040519 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1118122 | |
| r | 568031 | |
| i | 568031 | |
| g | 568031 | |
| h | 568031 | |
| l | 550091 | |
| e | 550091 | |
| f | 550091 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5040519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1118122 | |
| r | 568031 | |
| i | 568031 | |
| g | 568031 | |
| h | 568031 | |
| l | 550091 | |
| e | 550091 | |
| f | 550091 |
x
Real number (ℝ≥0)
| Distinct | 11708 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.95227535 |
| Minimum | 0.25 |
|---|---|
| Maximum | 119.72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 0.25 |
|---|---|
| 5-th percentile | 20.72 |
| Q1 | 40.8 |
| median | 60.13 |
| Q3 | 78.87 |
| 95-th percentile | 99.6195 |
| Maximum | 119.72 |
| Range | 119.47 |
| Interquartile range (IQR) | 38.07 |
Descriptive statistics
| Standard deviation | 24.22567114 |
|---|---|
| Coefficient of variation (CV) | 0.4040825973 |
| Kurtosis | -0.8226969423 |
| Mean | 59.95227535 |
| Median Absolute Deviation (MAD) | 19.04 |
| Skewness | 0.009726454646 |
| Sum | 67033958.02 |
| Variance | 586.8831421 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 66.62 | 203 | < 0.1% |
| 86.45 | 203 | < 0.1% |
| 66.16 | 200 | < 0.1% |
| 86.55 | 200 | < 0.1% |
| 64.38 | 200 | < 0.1% |
| 66.76 | 198 | < 0.1% |
| 61.83 | 198 | < 0.1% |
| 67.19 | 197 | < 0.1% |
| 64.39 | 197 | < 0.1% |
| 61.29 | 196 | < 0.1% |
| Other values (11698) | 1116130 |
| Value | Count | Frequency (%) |
| 0.25 | 2 | |
| 0.26 | 2 | |
| 0.27 | 1 | |
| 0.28 | 1 | |
| 0.29 | 1 | |
| 0.31 | 1 | |
| 0.34 | 1 | |
| 0.38 | 1 | |
| 0.42 | 1 | |
| 0.49 | 1 |
| Value | Count | Frequency (%) |
| 119.72 | 1 | |
| 119.71 | 1 | |
| 119.68 | 1 | |
| 119.64 | 1 | |
| 119.6 | 1 | |
| 119.59 | 1 | |
| 119.52 | 2 | |
| 119.47 | 1 | |
| 119.42 | 1 | |
| 119.35 | 1 |
y
Real number (ℝ)
| Distinct | 5386 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.84171764 |
| Minimum | -2.61 |
|---|---|
| Maximum | 57.01 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 58 |
| Negative (%) | < 0.1% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | -2.61 |
|---|---|
| 5-th percentile | 11.39 |
| Q1 | 22.03 |
| median | 26.85 |
| Q3 | 31.71 |
| 95-th percentile | 42.16 |
| Maximum | 57.01 |
| Range | 59.62 |
| Interquartile range (IQR) | 9.68 |
Descriptive statistics
| Standard deviation | 8.385431814 |
|---|---|
| Coefficient of variation (CV) | 0.3124029515 |
| Kurtosis | 0.2898441369 |
| Mean | 26.84171764 |
| Median Absolute Deviation (MAD) | 4.84 |
| Skewness | -0.01626457147 |
| Sum | 30012315.01 |
| Variance | 70.3154667 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.74 | 1085 | 0.1% |
| 23.75 | 1062 | 0.1% |
| 23.8 | 1058 | 0.1% |
| 23.9 | 1049 | 0.1% |
| 29.9 | 1039 | 0.1% |
| 23.88 | 1038 | 0.1% |
| 29.88 | 1034 | 0.1% |
| 23.83 | 1032 | 0.1% |
| 23.87 | 1031 | 0.1% |
| 23.85 | 1030 | 0.1% |
| Other values (5376) | 1107664 |
| Value | Count | Frequency (%) |
| -2.61 | 9 | |
| -2.6 | 3 | < 0.1% |
| -2.59 | 2 | < 0.1% |
| -2.58 | 3 | < 0.1% |
| -2.57 | 1 | < 0.1% |
| -2.56 | 1 | < 0.1% |
| -2.54 | 3 | < 0.1% |
| -2.53 | 3 | < 0.1% |
| -2.52 | 3 | < 0.1% |
| -2.51 | 5 |
| Value | Count | Frequency (%) |
| 57.01 | 1 | |
| 56.47 | 1 | |
| 56.44 | 1 | |
| 55.85 | 1 | |
| 55.82 | 1 | |
| 55.81 | 1 | |
| 55.71 | 1 | |
| 55.62 | 2 | |
| 55.6 | 1 | |
| 55.59 | 1 |
| Distinct | 2178 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.574296061 |
| Minimum | 0 |
|---|---|
| Maximum | 28.3 |
| Zeros | 71193 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.73 |
| median | 2.13 |
| Q3 | 3.8 |
| 95-th percentile | 6.75 |
| Maximum | 28.3 |
| Range | 28.3 |
| Interquartile range (IQR) | 3.07 |
Descriptive statistics
| Standard deviation | 2.403109625 |
|---|---|
| Coefficient of variation (CV) | 0.9335016518 |
| Kurtosis | 15.28627553 |
| Mean | 2.574296061 |
| Median Absolute Deviation (MAD) | 1.51 |
| Skewness | 2.441052427 |
| Sum | 2878377.06 |
| Variance | 5.774935869 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 71193 | 6.4% |
| 0.01 | 18089 | 1.6% |
| 0.02 | 10548 | 0.9% |
| 0.03 | 7660 | 0.7% |
| 0.04 | 6304 | 0.6% |
| 0.05 | 5545 | 0.5% |
| 0.06 | 4823 | 0.4% |
| 0.07 | 4679 | 0.4% |
| 0.08 | 4283 | 0.4% |
| 0.09 | 4042 | 0.4% |
| Other values (2168) | 980956 |
| Value | Count | Frequency (%) |
| 0 | 71193 | |
| 0.01 | 18089 | 1.6% |
| 0.02 | 10548 | 0.9% |
| 0.03 | 7660 | 0.7% |
| 0.04 | 6304 | 0.6% |
| 0.05 | 5545 | 0.5% |
| 0.06 | 4823 | 0.4% |
| 0.07 | 4679 | 0.4% |
| 0.08 | 4283 | 0.4% |
| 0.09 | 4042 | 0.4% |
| Value | Count | Frequency (%) |
| 28.3 | 1 | |
| 28.21 | 1 | |
| 28.18 | 1 | |
| 28.04 | 1 | |
| 28.01 | 1 | |
| 27.96 | 1 | |
| 27.8 | 1 | |
| 27.76 | 1 | |
| 27.71 | 1 | |
| 27.61 | 1 |
| Distinct | 1659 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.794617904 |
| Minimum | 0 |
|---|---|
| Maximum | 50.69 |
| Zeros | 66378 |
| Zeros (%) | 5.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.7 |
| median | 1.54 |
| Q3 | 2.59 |
| 95-th percentile | 4.47 |
| Maximum | 50.69 |
| Range | 50.69 |
| Interquartile range (IQR) | 1.89 |
Descriptive statistics
| Standard deviation | 1.458680584 |
|---|---|
| Coefficient of variation (CV) | 0.812808443 |
| Kurtosis | 9.463408966 |
| Mean | 1.794617904 |
| Median Absolute Deviation (MAD) | 0.93 |
| Skewness | 1.600224501 |
| Sum | 2006601.76 |
| Variance | 2.127749047 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 66378 | 5.9% |
| 0.01 | 14490 | 1.3% |
| 0.02 | 8486 | 0.8% |
| 0.03 | 6263 | 0.6% |
| 0.04 | 5177 | 0.5% |
| 0.05 | 4413 | 0.4% |
| 0.06 | 3711 | 0.3% |
| 1.01 | 3533 | 0.3% |
| 1.29 | 3502 | 0.3% |
| 1.1 | 3474 | 0.3% |
| Other values (1649) | 998695 |
| Value | Count | Frequency (%) |
| 0 | 66378 | |
| 0.01 | 14490 | 1.3% |
| 0.02 | 8486 | 0.8% |
| 0.03 | 6263 | 0.6% |
| 0.04 | 5177 | 0.5% |
| 0.05 | 4413 | 0.4% |
| 0.06 | 3711 | 0.3% |
| 0.07 | 3393 | 0.3% |
| 0.08 | 3012 | 0.3% |
| 0.09 | 2837 | 0.3% |
| Value | Count | Frequency (%) |
| 50.69 | 1 | |
| 36.39 | 1 | |
| 31.55 | 1 | |
| 28.87 | 1 | |
| 28.77 | 1 | |
| 28.48 | 1 | |
| 26.67 | 1 | |
| 26.2 | 1 | |
| 26.16 | 1 | |
| 25.88 | 1 |
| Distinct | 563 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2608324762 |
| Minimum | 0 |
|---|---|
| Maximum | 8.46 |
| Zeros | 70153 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.07 |
| median | 0.21 |
| Q3 | 0.38 |
| 95-th percentile | 0.68 |
| Maximum | 8.46 |
| Range | 8.46 |
| Interquartile range (IQR) | 0.31 |
Descriptive statistics
| Standard deviation | 0.2579967705 |
|---|---|
| Coefficient of variation (CV) | 0.9891282492 |
| Kurtosis | 54.10544083 |
| Mean | 0.2608324762 |
| Median Absolute Deviation (MAD) | 0.15 |
| Skewness | 4.408429249 |
| Sum | 291642.53 |
| Variance | 0.06656233361 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 70153 | 6.3% |
| 0.01 | 62703 | 5.6% |
| 0.02 | 35547 | 3.2% |
| 0.03 | 27544 | 2.5% |
| 0.04 | 23568 | 2.1% |
| 0.05 | 21796 | 1.9% |
| 0.18 | 20496 | 1.8% |
| 0.06 | 20488 | 1.8% |
| 0.19 | 20486 | 1.8% |
| 0.2 | 20453 | 1.8% |
| Other values (553) | 794888 |
| Value | Count | Frequency (%) |
| 0 | 70153 | |
| 0.01 | 62703 | |
| 0.02 | 35547 | |
| 0.03 | 27544 | 2.5% |
| 0.04 | 23568 | 2.1% |
| 0.05 | 21796 | 1.9% |
| 0.06 | 20488 | 1.8% |
| 0.07 | 19701 | 1.8% |
| 0.08 | 19521 | 1.7% |
| 0.09 | 19406 | 1.7% |
| Value | Count | Frequency (%) |
| 8.46 | 1 | |
| 7.98 | 1 | |
| 7.9 | 1 | |
| 7.31 | 1 | |
| 7 | 1 | |
| 6.86 | 1 | |
| 6.6 | 1 | |
| 6.55 | 1 | |
| 6.53 | 1 | |
| 6.4 | 1 |
| Distinct | 36001 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 48614 |
| Missing (%) | 4.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179.6703195 |
| Minimum | 0 |
|---|---|
| Maximum | 360 |
| Zeros | 10 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31.35 |
| Q1 | 89.38 |
| median | 178.995 |
| Q3 | 269.43 |
| 95-th percentile | 329.82 |
| Maximum | 360 |
| Range | 360 |
| Interquartile range (IQR) | 180.05 |
Descriptive statistics
| Standard deviation | 99.26149134 |
|---|---|
| Coefficient of variation (CV) | 0.5524646008 |
| Kurtosis | -1.369466549 |
| Mean | 179.6703195 |
| Median Absolute Deviation (MAD) | 90.015 |
| Skewness | 0.01038449911 |
| Sum | 192158844.1 |
| Variance | 9852.843663 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 1988 | 0.2% |
| 93.34 | 112 | < 0.1% |
| 82.82 | 106 | < 0.1% |
| 81.35 | 104 | < 0.1% |
| 84.84 | 103 | < 0.1% |
| 93.2 | 103 | < 0.1% |
| 89.76 | 103 | < 0.1% |
| 82.68 | 102 | < 0.1% |
| 266.07 | 102 | < 0.1% |
| 84.26 | 101 | < 0.1% |
| Other values (35991) | 1066584 | |
| (Missing) | 48614 | 4.3% |
| Value | Count | Frequency (%) |
| 0 | 10 | |
| 0.01 | 20 | |
| 0.02 | 10 | |
| 0.03 | 21 | |
| 0.04 | 19 | |
| 0.05 | 18 | |
| 0.06 | 11 | |
| 0.07 | 17 | |
| 0.08 | 18 | |
| 0.09 | 21 |
| Value | Count | Frequency (%) |
| 360 | 13 | |
| 359.99 | 18 | |
| 359.98 | 24 | |
| 359.97 | 13 | |
| 359.96 | 13 | |
| 359.95 | 18 | |
| 359.94 | 13 | |
| 359.93 | 20 | |
| 359.92 | 19 | |
| 359.91 | 13 |
| Distinct | 36001 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 48614 |
| Missing (%) | 4.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180.8124916 |
| Minimum | 0 |
|---|---|
| Maximum | 360 |
| Zeros | 56 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 24.08 |
| Q1 | 91.03 |
| median | 179.87 |
| Q3 | 270.92 |
| 95-th percentile | 337.19 |
| Maximum | 360 |
| Range | 360 |
| Interquartile range (IQR) | 179.89 |
Descriptive statistics
| Standard deviation | 101.1737089 |
|---|---|
| Coefficient of variation (CV) | 0.5595504378 |
| Kurtosis | -1.290945778 |
| Mean | 180.8124916 |
| Median Absolute Deviation (MAD) | 89.93 |
| Skewness | -7.171900654 × 10-6 |
| Sum | 193380406.3 |
| Variance | 10236.11936 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98.01 | 75 | < 0.1% |
| 277.67 | 74 | < 0.1% |
| 89 | 73 | < 0.1% |
| 266.29 | 72 | < 0.1% |
| 93.55 | 71 | < 0.1% |
| 95.14 | 71 | < 0.1% |
| 268.94 | 71 | < 0.1% |
| 275.12 | 71 | < 0.1% |
| 96.86 | 70 | < 0.1% |
| 263.51 | 70 | < 0.1% |
| Other values (35991) | 1068790 | |
| (Missing) | 48614 | 4.3% |
| Value | Count | Frequency (%) |
| 0 | 56 | |
| 0.01 | 20 | < 0.1% |
| 0.02 | 31 | |
| 0.03 | 28 | |
| 0.04 | 23 | |
| 0.05 | 22 | < 0.1% |
| 0.06 | 20 | < 0.1% |
| 0.07 | 26 | |
| 0.08 | 24 | |
| 0.09 | 21 | < 0.1% |
| Value | Count | Frequency (%) |
| 360 | 13 | |
| 359.99 | 18 | |
| 359.98 | 21 | |
| 359.97 | 25 | |
| 359.96 | 29 | |
| 359.95 | 18 | |
| 359.94 | 28 | |
| 359.93 | 31 | |
| 359.92 | 22 | |
| 359.91 | 18 |
event
Categorical
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.5 MiB |
| None | |
|---|---|
| ball_snap | 26956 |
| pass_forward | 24127 |
| autoevent_ballsnap | 13455 |
| autoevent_passforward | 12627 |
| Other values (18) | 12420 |
Length
| Max length | 25 |
|---|---|
| Median length | 4 |
| Mean length | 4.72758876 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5286021 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 1028537 | |
| ball_snap | 26956 | 2.4% |
| pass_forward | 24127 | 2.2% |
| autoevent_ballsnap | 13455 | 1.2% |
| autoevent_passforward | 12627 | 1.1% |
| play_action | 5382 | 0.5% |
| qb_sack | 1334 | 0.1% |
| run | 1219 | 0.1% |
| pass_arrived | 1150 | 0.1% |
| autoevent_passinterrupted | 690 | 0.1% |
| Other values (13) | 2645 | 0.2% |
Length
| Value | Count | Frequency (%) |
| none | 1028537 | |
| ball_snap | 26956 | 2.4% |
| pass_forward | 24127 | 2.2% |
| autoevent_ballsnap | 13455 | 1.2% |
| autoevent_passforward | 12627 | 1.1% |
| play_action | 5382 | 0.5% |
| qb_sack | 1334 | 0.1% |
| run | 1219 | 0.1% |
| pass_arrived | 1150 | 0.1% |
| autoevent_passinterrupted | 690 | 0.1% |
| Other values (13) | 2645 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1105794 | |
| o | 1099515 | |
| e | 1086796 | |
| N | 1028537 | |
| a | 197869 | 3.7% |
| s | 121578 | 2.3% |
| _ | 88872 | 1.7% |
| l | 87032 | 1.6% |
| p | 86503 | 1.6% |
| r | 78913 | 1.5% |
| Other values (15) | 304612 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4168612 | |
| Uppercase Letter | 1028537 | 19.5% |
| Connector Punctuation | 88872 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1105794 | |
| o | 1099515 | |
| e | 1086796 | |
| a | 197869 | 4.7% |
| s | 121578 | 2.9% |
| l | 87032 | 2.1% |
| p | 86503 | 2.1% |
| r | 78913 | 1.9% |
| t | 63503 | 1.5% |
| b | 42021 | 1.0% |
| Other values (13) | 199088 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1028537 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 88872 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5197149 | |
| Common | 88872 | 1.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1105794 | |
| o | 1099515 | |
| e | 1086796 | |
| N | 1028537 | |
| a | 197869 | 3.8% |
| s | 121578 | 2.3% |
| l | 87032 | 1.7% |
| p | 86503 | 1.7% |
| r | 78913 | 1.5% |
| t | 63503 | 1.2% |
| Other values (14) | 241109 | 4.6% |
Common
| Value | Count | Frequency (%) |
| _ | 88872 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5286021 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1105794 | |
| o | 1099515 | |
| e | 1086796 | |
| N | 1028537 | |
| a | 197869 | 3.7% |
| s | 121578 | 2.3% |
| _ | 88872 | 1.7% |
| l | 87032 | 1.6% |
| p | 86503 | 1.6% |
| r | 78913 | 1.5% |
| Other values (15) | 304612 | 5.8% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| gameId | playId | nflId | frameId | time | jerseyNumber | team | playDirection | x | y | s | a | dis | o | dir | event | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2021090900 | 97 | 25511.0 | 1 | 2021-09-10T00:26:31.100 | 12.0 | TB | right | 37.77 | 24.22 | 0.29 | 0.30 | 0.03 | 165.16 | 84.99 | None |
| 1 | 2021090900 | 97 | 25511.0 | 2 | 2021-09-10T00:26:31.200 | 12.0 | TB | right | 37.78 | 24.22 | 0.23 | 0.11 | 0.02 | 164.33 | 92.87 | None |
| 2 | 2021090900 | 97 | 25511.0 | 3 | 2021-09-10T00:26:31.300 | 12.0 | TB | right | 37.78 | 24.24 | 0.16 | 0.10 | 0.01 | 160.24 | 68.55 | None |
| 3 | 2021090900 | 97 | 25511.0 | 4 | 2021-09-10T00:26:31.400 | 12.0 | TB | right | 37.73 | 24.25 | 0.15 | 0.24 | 0.06 | 152.13 | 296.85 | None |
| 4 | 2021090900 | 97 | 25511.0 | 5 | 2021-09-10T00:26:31.500 | 12.0 | TB | right | 37.69 | 24.26 | 0.25 | 0.18 | 0.04 | 148.33 | 287.55 | None |
| 5 | 2021090900 | 97 | 25511.0 | 6 | 2021-09-10T00:26:31.600 | 12.0 | TB | right | 37.64 | 24.26 | 0.35 | 0.53 | 0.05 | 144.42 | 282.72 | ball_snap |
| 6 | 2021090900 | 97 | 25511.0 | 7 | 2021-09-10T00:26:31.700 | 12.0 | TB | right | 37.56 | 24.26 | 0.54 | 1.05 | 0.08 | 137.49 | 272.95 | None |
| 7 | 2021090900 | 97 | 25511.0 | 8 | 2021-09-10T00:26:31.800 | 12.0 | TB | right | 37.47 | 24.25 | 0.80 | 1.85 | 0.09 | 131.95 | 267.49 | None |
| 8 | 2021090900 | 97 | 25511.0 | 9 | 2021-09-10T00:26:31.900 | 12.0 | TB | right | 37.38 | 24.24 | 0.99 | 2.03 | 0.09 | 129.85 | 263.48 | None |
| 9 | 2021090900 | 97 | 25511.0 | 10 | 2021-09-10T00:26:32.000 | 12.0 | TB | right | 37.27 | 24.23 | 1.19 | 1.82 | 0.11 | 123.79 | 263.77 | None |
Last rows
| gameId | playId | nflId | frameId | time | jerseyNumber | team | playDirection | x | y | s | a | dis | o | dir | event | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1118112 | 2021091300 | 4845 | NaN | 25 | 2021-09-14T03:54:20.100 | NaN | football | left | 50.84 | 24.73 | 4.30 | 0.46 | 0.46 | NaN | NaN | None |
| 1118113 | 2021091300 | 4845 | NaN | 26 | 2021-09-14T03:54:20.200 | NaN | football | left | 51.25 | 24.83 | 4.25 | 1.15 | 0.43 | NaN | NaN | None |
| 1118114 | 2021091300 | 4845 | NaN | 27 | 2021-09-14T03:54:20.300 | NaN | football | left | 51.67 | 24.93 | 4.14 | 1.83 | 0.42 | NaN | NaN | None |
| 1118115 | 2021091300 | 4845 | NaN | 28 | 2021-09-14T03:54:20.400 | NaN | football | left | 52.06 | 25.03 | 3.96 | 1.93 | 0.40 | NaN | NaN | None |
| 1118116 | 2021091300 | 4845 | NaN | 29 | 2021-09-14T03:54:20.500 | NaN | football | left | 52.43 | 25.13 | 3.77 | 1.98 | 0.39 | NaN | NaN | autoevent_passforward |
| 1118117 | 2021091300 | 4845 | NaN | 30 | 2021-09-14T03:54:20.600 | NaN | football | left | 52.78 | 25.23 | 3.58 | 1.95 | 0.37 | NaN | NaN | pass_forward |
| 1118118 | 2021091300 | 4845 | NaN | 31 | 2021-09-14T03:54:20.700 | NaN | football | left | 50.31 | 26.46 | 17.16 | 0.25 | 2.77 | NaN | NaN | None |
| 1118119 | 2021091300 | 4845 | NaN | 32 | 2021-09-14T03:54:20.800 | NaN | football | left | 48.66 | 26.99 | 17.10 | 1.05 | 1.73 | NaN | NaN | None |
| 1118120 | 2021091300 | 4845 | NaN | 33 | 2021-09-14T03:54:20.900 | NaN | football | left | 47.04 | 27.53 | 16.98 | 1.67 | 1.71 | NaN | NaN | None |
| 1118121 | 2021091300 | 4845 | NaN | 34 | 2021-09-14T03:54:21.000 | NaN | football | left | 45.42 | 28.08 | 16.89 | 1.82 | 1.71 | NaN | NaN | None |